首页> 外文OA文献 >Significance tests and weighted values for AFLP similarities, based on Arabidopsis in silico AFLP fragment length distributions.
【2h】

Significance tests and weighted values for AFLP similarities, based on Arabidopsis in silico AFLP fragment length distributions.

机译:基于拟南芥计算机AFLP片段长度分布的AFLP相似性的显着性检验和加权值。

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Many AFLP studies include relatively unrelated genotypes that contribute noise to data sets instead of signal. We developed: (1) estimates of expected AFLP similarities between unrelated genotypes, (2) significance tests for AFLP similarities, enabling the detection of unrelated genotypes, and (3) weighted similarity coefficients, including band position information. Detection of unrelated genotypes and use of weighted similarity coefficients will make the analysis of AFLP data sets more informative and more reliable. Test statistics and weighted coefficients were developed for total numbers of shared bands and for Dice, Jaccard, Nei and Li, and simple matching (dis)similarity coefficients. Theoretical and in silico AFLP fragment length distributions (FLDs) were examined as a basis for the tests. The in silico AFLP FLD based on the Arabidopsis thaliana genome sequence was the most appropriate for angiosperms. The G + C content of the selective nucleotides in the in silico AFLP procedure significantly influenced the FLD. Therefore, separate test statistics were calculated for AFLP procedures with high, average, and low G + C contents in the selective nucleotides. The test statistics are generally applicable for angiosperms with a G + C content of approximately 35-40%, but represent conservative estimates for genotypes with higher G + C contents. For the latter, test statistics based on a rice genome sequence are more appropriate.
机译:许多AFLP研究包括相对不相关的基因型,这些基因型将噪声贡献给数据集而不是信号。我们开发了:(1)估计不相关基因型之间的AFLP相似性的估计,(2)AFLP相似性的显着性检验,能够检测不相关的基因型,以及(3)加权相似系数,包括谱带位置信息。对不相关基因型的检测和加权相似系数的使用将使对AFLP数据集的分析更加有益和可靠。针对共享频段的总数以及Dice,Jaccard,Nei和Li的测试统计数据和加权系数,以及简单的匹配(不相似)相似系数进行了开发。理论上和计算机上AFLP片段长度分布(FLD)被作为测试的基础。基于拟南芥基因组序列的计算机AFLP FLD最适合被子植物。计算机AFLP过程中选择性核苷酸的G + C含量显着影响FLD。因此,针对AFLP程序计算了单独的测试统计数据,这些程序在选择性核苷酸中具有高,平均和低G + C含量。测试统计数据通常适用于G + C含量约为35-40%的被子植物,但代表了G + C含量较高的基因型的保守估计。对于后者,基于水稻基因组序列的检验统计数据更为合适。

著录项

  • 作者

    Koopman, Wim J M; Gort, Gerrit;

  • 作者单位
  • 年度 2004
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类
  • 入库时间 2022-08-20 20:37:09

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号